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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANTS : 



KattLa Georgopoulos 
Bruce Morgan 



10 



(ii) TITLE OF INVENTIC 



The Aiolos Gene 



(iii) NUMBER OF SEQUENCES: 22 



15 



20 



25 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: /LAHIVE & COCKFIELD 

(B) STREET: 601 State Street, Suite 510 

(C) CITY: Bostfon 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP : 0210^-1875 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTED: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

<D) SOFTWARE: Patentln Release #1.0, Version #1.25 



30 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING/ DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: /Myers, Louis 

(B) REGISTRATION NUMBER:- 35,965 

(C) REFERENCE /DOCKET NUMBER: MGP-042-2 



40 



45 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617)227-7400 

(B) TELEFAX: (617)227-5941 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



50 



(ii) MOLECUILE TYPE: cDNA 



55 



(ix) FEAT 
(A) 
(B) 



IE /KEY: CDS 
JOCATION: 374 . . 1895 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
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CACGAGCGCA CACCGCTCGG CTCTCCTTGC GACACGCCCT CATCCCCGGT GTTTCTCAAG 
60 

TAGACGTCCC GAGACGGTCG CTGAGGCACT GTTTCCACGC GATCAGGGTT CCTCAGGCTT 
120 

GACATTCAAA AGTGGGTGCG GAACCCGCGG CACTCGGAGC GTGCTTTAAA GCGGCCGCCA 
180 

GCCAGCGCCG CTCTAACCTC GCGCCCCGGC TGCCGGCGGC TCCCGCCCTG CATCTGCGCC 
240 

GACGCGACCG AGCGATCCCG GGGCCTCCCT GCGCCCGGAA TCTCCCGCCA GCCGCGCGGG 
300 

TCCCCACGGC AGCAGCACGT GGAGCGGCCG CGGAGCCTGA GCGACAGCTG CAGCCCGCGC 
360 

GGCCCGCGGC GAG ATG GAA GAT ATA CAA CCG ACT GTG GAG CTG AAA AGC 
409 

Met Glu Asp lie Gin Pro Thr Val Glu Leu Lys Ser 
15 10 

ACG GAG GAG CAG CCT CTG CCC ACA GAG AGC CCA GAC GCT CTG AAT GAC 
457 

Thr Glu Glu Gin Pro Leu Pro Thr Glu Ser Pro Asp Ala Leu Asn Asp 
15 20 25 

TAC AGC TTG CCC AAA CCT CAT GAG ATA GAA AAC GTG GAC AGT AGA GAA 
505 

Tyr Ser Leu Pro Lys Pro His Glu lie Glu Asn Val Asp Ser Arg Glu 
3 0 35 4 0 

GCC CCA GCC AAT GAA GAC GAA GAT GCA GGA GAA GAT TCG ATG AAA GTG 
553 

Ala Pro Ala Asn Glu Asp Glu Asp Ala Gly Glu Asp Ser Met Lys Val 

45 50 55 60 

AAA GAT GAA TAC AGC GAC AGA GAT GAG AAC ATT ATG AAG CCG GAG CCC 
601 

Lys Asp Glu Tyr Ser Asp Arg Asp Glu Asn lie Met Lys Pro Glu Pro 

65 70 75 

ATG GGA GAT GCA GAA GAG AGT GAA ATG CCT TAC AGC TAT GCA AGA GAA 
649 

Met Gly Asp Ala Glu Glu Ser Glu Met Pro Tyr Ser Tyr Ala Arg Glu 
80 85 90 

TAC AGC GAC TAT GAA AGC ATT AAG CTG GAG AGA CAC GTG CCC TAT GAC 
697 

Tyr Ser Asp Tyr Glu Ser lie Lys Leu Glu Arg His Val Pro Tyr Asp 
95 100 105 

AAC AGC AGA CCA ACC AGT GGG AAG ATG AAC TGC GAC GTG TGC GGG TTA 
745 

Asn Ser Arg Pro Thr Ser Gly Lys Met Asn Cys Asp Val Cys Gly Leu 
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110 115 120 

TCC TGC ATT AGC TTC AAC GTC TTG ATG GTT CAT AAG CGA AGC CAT ACC 
793 

Ser Cys lie Ser Phe Asn Val Leu Met Val His Lys Arg Ser His Thr 

125 130 135 140 

GGC GAA CGC CCG TTC CAG TGT AAT CAG TGC GGG GCA TCT TTT ACT CAG 
841 

Gly Glu Arg Pro Phe Gin Cys Asn Gin Cys Gly Ala Ser Phe Thr Gin 
145 150 155 

AAA GGT AAC CTC CTC CGT CAT ATT AAA CTG CAC ACG GGG GAA AAA CCT 
889 

Lys Gly Asn Leu Leu Arg His lie Lys Leu His Thr Gly Glu Lys Pro 
160 165 170 

TTT AAG TGT CAC CTC TGC AAC TAC GCA TGC CAA AGG AGA GAT GCG CTC 
937 

Phe Lys Cys His Leu Cys Asn Tyr Ala Cys Gin Arg Arg Asp Ala Leu 
175 180 185 

ACG GGA CAC CTT AGG ACA CAT TCT GTG GAG AAG CCG TAC AAG TGT GAG 
985 

Thr Gly His Leu Arg Thr His Ser Val Glu Lys Pro Tyr Lys Cys Glu 
190 195 200 

TTC TGC GGA AGA AGC TAC AAG CAG AGA AGC TCC CTG GAG GAG CAC AAG 
1033 

Phe Cys Gly Arg Ser Tyr Lys Gin Arg Ser Ser Leu Glu Glu His Lys 
205 210 215 220 

GAA CGC TGC" CGA GCT TTT" CTT "CAG AAC CCT GAC CTG GGG GAC- GCT GCA 
1081 

Glu Arg Cys Arg Ala Phe Leu Gin Asn Pro Asp Leu Gly Asp Ala Ala 
225 230 235 

AGT GTG GAG GCA AGA CAC ATC AAA GCC GAG ATG GGA AGT GAG AGA GCT 
1129 

Ser Val Glu Ala Arg His lie Lys Ala Glu Met Gly Ser Glu Arg Ala 
240 245 250 

CTC GTC CTG GAC AGA TTA GCA AGC AAT GTG GCT AAG CGA AAA AGC TCG 
1177 

Leu Val Leu Asp Arg Leu Ala Ser Asn Val Ala Lys Arg Lys Ser Ser 
255 260 265 

ATG CCT CAG AAA TTC ATC GGT GAG AAG CGG CAC TGC TTC GAT GCC AAC 
1225 

Met Pro Gin Lys Phe lie Gly Glu Lys Arg His Cys Phe Asp Ala Asn 
270 275 280 

TAC AAT CCC GGC TAC ATG TAC GAG AAG GAG AAC GAG ATG ATG CAG ACC 
1273 

Tyr Asn Pro Gly Tyr Met Tyr Glu Lys Glu Asn Glu Met Met Gin Thr 
285 290 295 300 



CGG ATG ATG GAC 
1321 

Arg Met Met Asp 



GAA GCC TTC CGC 
1369 

Glu Ala Phe Arg 
320 

ATG GTC CCA GTC 
1417 

Met Val Pro Val 
335 

GAT ATG CCA ATG 
1465 

Asp Met Pro Met 
350 

CTG CCA GAG AAG 
1513 

Leu Pro Glu Lys 
365 

AGT GCC CAG GAC 
1561 

Ser Ala Gin Asp 



CAT CTC TAC CAG 
1609 

His Leu Tyr Gin 
400 

ATG CCT CTT CTG 
1657 

Met Pro Leu Leu 
415 

CCT CCC ATC TGC 
1705 

Pro Pro lie Cys 
430 

GAG GTG ATG GAT 
1753 

Glu Val Met Asp 
445 

GAT TAT GTG ATG 
1801 

Asp Tyr Val Met 



CCC TTT GAG TGT 
1849 



CAA GCC ATC AAT 

Gin Ala lie Asn 
305 

CCC TTA GTC CAG 

Pro Leu Val Gin 

ATC AGC AGT GTG 

lie Ser Ser Val 
340 

GGG GCC CCG CAG 

Gly Ala Pro Gin 
355 

ATC TTG CCT TCT 

lie Leu Pro Ser 
370 

TCC ACA GAC ACC 

Ser Thr Asp Thr 
385 

CAA AGC CAC GTG 
Gin Ser His Val 

AAG GAG GTC CCT 

Lys Glu Val Pro 
420 

CTG AGG GAC TCC 

Leu Arg Asp Ser 
435 

GTG TTT CGA TGT 

Val Phe Arg Cys 
450 

TTC ACC ATC CAC 

Phe Thr lie His 
465 

AAC ATG TGT GGC 



- 103- 

AAC GCC ATC AGC 

Asn Ala lie Ser 
310 

ACT CCG CCT GCT 

Thr Pro Pro Ala 
325 

TAC CCC ATA GCA 
Tyr Pro lie Ala 

GAG ATG GAA AAG 

Glu Met Glu Lys 
360 

GAA CGA GGT CTG 

Glu Arg Gly Leu 
375 

GAC AGC AAC CAC 

Asp Ser Asn His 
390 

GTC CTC CCC CAG 

Val Leu Pro Gin 
405 

CGC TCT TTT GAA 
Arg Ser Phe Glu 

ATC* AAA GTG ATC 

lie Lys Val lie 
440 

GAC CAC TGC CAC 

Asp His Cys His 
455 

ATG GGG TGC CAT 

Met Gly Cys His 
470 

TAT CGA AGC CAC 



TAT CTA GGG GCT 

Tyr Leu Gly Ala 
315 

CCC ACC TCT GAG 

Pro Thr Ser Glu 
330 

CTT ACT CGG GCC 

Leu Thr Arg Ala 
345 

AAA CGG ATC CTC 

Lys Arg lie Leu 

TCC CCC AAT AAC 

Ser Pro Asn Asn 
380 

GAG GAT CGC CAA 

Glu Asp Arg Gin 
395 

GCC CGC AAT GGG 

Ala -Arg Asn Gly 
410 

CTC CTC AAG CCC 

Leu Leu Lys Pro 
425 

AAC AAA GAA GGG 
Asn Lys Glu Gly 

GTC CTC TTC CTA 

Val Leu Phe Leu 
460 

GGT TTC CGT GAT 

Gly Phe Arg Asp 
475 

GAT CGC TAT GAG 
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Pro Phe Glu Cys Asn Met Cys Gly Tyr Arg Ser His Asp Arg Tyr Glu 
480 485 490 

TTC TCC TCT CAC ATC GCC AGA GGA GAG CAC AGA GCC ATG TTG AAG T 
1895 

Phe Ser Ser His lie Ala Arg Gly Glu His Arg Ala Met Leu Lys 
495 500 505 

GAGCATCTGT CCTCAATGCG AGGGTCAACA TTGTTTTTTA AAGCTGATGG TAGCCTTATC 
1955 

CAGTAGACTG AACTCAAACC CACCTCGAG 
1984 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Glu Asp lie Gin Pro Thr Val Glu Leu Lys Ser Thr Glu Glu Gin 
15 10 15 

Pro Leu Pro Thr Glu Ser Pro Asp Ala Leu Asn Asp Tyr Ser Leu Pro 
20 25 30 

Lys Pro His Glu lie Glu Asn Val Asp Ser Arg Glu Ala Pro Ala Asn 

" ~35 40 45 

Glu Asp Glu Asp Ala Gly Glu Asp Ser Met Lys Val Lys Asp Glu Tyr 
50 55 60 

Ser Asp Arg Asp Glu Asn lie Met Lys Pro Glu Pro Met Gly Asp Ala 
65 70 75 80 

Glu Glu Ser Glu Met Pro Tyr Ser Tyr Ala Arg Glu Tyr Ser Asp Tyr 
85 90 95 

Glu Ser lie Lys Leu Glu Arg His Val Pro Tyr Asp Asn Ser Arg Pro 
100 .105 110 

Thr Ser Gly Lys Met Asn Cys Asp Val Cys Gly Leu Ser Cys lie Ser 
115 120 125 

Phe Asn Val Leu Met Val His Lys Arg Ser His Thr Gly Glu Arg Pro 
130 135 140 

Phe Gin Cys Asn Gin Cys Gly Ala Ser Phe Thr Gin Lys Gly Asn Leu 
145 150 155 160 



Leu Arg 



His lie Lys Leu His Thr Gly Glu Lys Pro Phe Lys Cys 
165 170 175 



His 
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Leu Cys Asn Tyr Ala Cys Gin Arg Arg Asp Ala Leu Thr Gly His Leu 
180 185 190 

Arg Thr His Ser Val Glu Lys Pro Tyr Lys Cys Glu Phe Cys Gly Arg 
195 200 205 

Ser Tyr Lys Gin Arg Ser Ser Leu Glu Glu His Lys Glu Arg Cys Arg 
210 215 220 

Ala Phe Leu Gin Asn Pro Asp Leu Gly Asp Ala Ala Ser Val Glu Ala 
225 230 235 240 

Arg His lie Lys Ala Glu Met Gly Ser Glu Arg Ala Leu Val Leu Asp 
245 250 255 

Arg Leu Ala Ser Asn Val Ala Lys Arg Lys Ser Ser Met Pro Gin Lys 
260 265 270 

Phe lie Gly Glu Lys Arg His Cys Phe Asp Ala Asn Tyr Asn Pro Gly 
275 280 285 

Tyr Met Tyr Glu Lys Glu Asn Glu Met Met Gin Thr Arg Met Met Asp 
290 295 . 300 

Gin Ala lie Asn Asn Ala lie Ser Tyr Leu Gly Ala Glu Ala Phe Arg 
305 310 315 320 

Pro Leu Val Gin Thr Pro Pro Ala Pro Thr Ser Glu Met Val Pro Val 
325 330 335 

lie Ser Ser Val Tyr Pro lie Ala Leu Thr Arg Ala Asp Met Pro Met 
340 ~ 345 350 

Gly Ala Pro Gin Glu Met Glu Lys Lys Arg lie Leu Leu Pro Glu Lys 
355 360 365 

lie Leu Pro Ser Glu Arg Gly Leu Ser Pro Asn Asn Ser Ala Gin Asp 
370 375 380 

Ser Thr Asp Thr Asp Ser Asn His Glu Asp Arg Gin His Leu Tyr Gin 
385 . 390 395 400 

Gin Ser His Val Val Leu Pro Gin Ala Arg Asn Gly Met Pro Leu Leu 
405 410 415 

Lys Glu Val Pro Arg Ser Phe Glu Leu Leu Lys Pro Pro Pro lie Cys 
420 425 430 

Leu Arg Asp Ser lie Lys Val lie Asn Lys Glu Gly Glu Val Met Asp 
435 440 445 

Val Phe Arg Cys Asp His Cys His Val Leu Phe Leu Asp Tyr Val Met 
450 455 460 



Phe Thr lie His Met Gly Cys His Gly Phe Arg Asp Pro Phe Glu Cys 
465 470 475 480 
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Asn Met Cys Gly Tyr Arg Ser His Asp Arg Tyr Glu Phe Ser Ser His 
485 490 495 

5 lie Ala Arg Gly Glu His Arg Ala Met Leu Lys 
500 505 

(2) INFORMATION FOR SEQ ID NO : 3 : 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: cDNA 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
TACTACCATC TCACATGGGC TGACCA 

^ 25 (2) INFORMATION FOR SEQ ID NO : 4 : 

4f (i) SEQUENCE CHARACTERISTICS: 

+j (A) LENGTH: 26 base pairs 

y=? (B) TYPE: nucleic acid 

= 30 (C) STRANDEDNESS: single 

O (D) TOPOLOGY: linear 

ru 



S3 



35 



(ii) MOLECULE TYPE:- cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



GACCAGCACA TGTTGACACT CTGAAA 
40 26 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: cDNA 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GTGTGCGGGT TATCCTGCAT TAGC 
24 
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(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

ATCGAAG C AG TGCCGCTTCT CACC 
24 

(2) INFORMATION FOR SEQ ID NO : 7 : 
(2) INFORMATION FOR SEQ ID NO : 8 : 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

GTAACCTCCT CCGTCATATT AAAC 
24 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

CGAGCTTTTC TTCAGAACCC TGAC 
24 

(2) INFORMATION FOR SEQ ID NO: 11: 
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10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



TCAGCTTTTG GGAATACCCT GTCA 
15 24 

(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

20 (A) LENGTH: 24 base pairs 

_ (B) TYPE: nucleic acid 

W (C) STRANDEDNESS: single 

vO (D) TOPOLOGY: linear 
y 

H= 25 (ii) MOLECULE TYPE: cDNA 

yp 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TCAGCTTTTG GGGGTACCCT GTCA 
24 

(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 0 base pairs 
. (B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

ATGGTGAAGG TCGGTGTGAA CGGATTTGGC 
30 

50 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 0 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 

GCATCGAAGG TGGAAGAGTG GGAGTTGCTG 
30 

(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: cDNA 



^ (ix) FEATURE: 



(A) NAME /KEY : CDS 



UP (B) LOCATION: 223.. 1515 

U 25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



05 AATTCGTTCT ACCTTCTCTG AACCCCAGTG GTGTGTCAAG GCCGGACTGG GAGCTTGGGG 60 

30 



GAAGAGGAAG AGGAAGAGGA ATCTGCGGCT CATCCAGGGA TCAGGGTCCT TCCCAAGTGG 12 0 
CCACTCAGAG GGGACTCAGA GCAAGTCTAG ATTTGTGTGG CAGAGAGAGA CAGCTCTCGT 18 0 



w 35 TTGGCCTTGG GGAGGCACAA GTCTGTTGAT AACCTGAAGA CA 222 

^ ATG GAT GTC GAT GAG GGT CAA GAC ATG TCC CAA GTT TCA GGA AAG GAG 2 70 

Met Asp Val Asp Glu Gly Gin Asp Met Ser Gin Val Ser Gly Lys Glu 
15 10 15 

40 

AGC CCC CCA GTC AGT GAC ACT CCA GAT GAA GGG GAT GAG CCC ATG CCT 318 
Ser Pro Pro Val Ser Asp Thr Pro Asp Glu Gly Asp Glu Pro Met Pro 
20 25 30 

45 GTC CCT GAG GAC CTG TCC ACT ACC TCT GGA GCA CAG CAG AAC TCC AAG 366 
Val Pro Glu Asp Leu Ser Thr Thr Ser Gly Ala Gin Gin Asn Ser Lys 
35 40 45 

AGT GAT CGA GGC ATG GGT GAA CGG CCT TTC CAG TGC AAC CAG TCT GGG 414 
50 Ser Asp Arg Gly Met Gly Gin Arg Pro Phe Gin Cys Asn Gin Ser Gly 
50 55 60 

GCC TCC TTT ACC CAG AAA GGC AAC CTC CTG CGG CAC ATC AAG CTG CAC 462 
Ala Ser Phe Thr Gin Lys Gly Asn Leu Leu Arg His lie Lys Leu His 
55 65 • 70 75 80 



TCG GGT GAG AAG CCC TTC AAA TGC CAT CTT .TGC AAC TAT GCC TGC CGC 



510 
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Ser Gly Glu Lys Pro Phe Lys Cys His Leu Cys Asn Tyr Ala Cys Arg 
85 90 95 

CGG AGG GAC GCC CTC ACC GGC CAC CTG AGG ACG CAC TCC GTT GGT AAG 
Arg Arg Asp Ala Leu Thr Gly His Leu Arg Thr His Ser Val Gly Lys 
100 105 110 

CCT CAC AAA TGT GGA TAT TGT GGC CGG AGC TAT AAA CAG CGA AGC TCT 
Pro His Lys Cys Gly Tyr Cys Gly Arg Ser Tyr Lys Gin Arg Ser Ser 
115 120 125 

TTA GAG GAG CAT AAA GAG CGA TGC CAC AAC TAC TTG GAA AGC ATG GGC 
Leu Glu Glu His Lys Glu Arg Cys His Asn Tyr Leu Glu Ser Met Gly 
130 135 140 

CTT CCG GGC GTG TGC CCA GTC ATT AAG GAA GAA ACT AAC CAC AAC GAG 
Leu Pro Gly Val Cys Pro Val lie Lys Glu Glu Thr Asn His Asn Glu 
145 150 155 160 

ATG GCA GAA GAC CTG TGC AAG ATA GGA GCA GAG AGG TCC CTT GTC CTG 
Met Ala Glu Asp Leu Cys Lys lie Gly Ala Glu Arg Ser Leu Val Leu 
165 170 175 

GAC AGG CTG GCA AGC AAT GTC GCC AAA CGT AAG AGC TCT ATG CCT CAG 
Asp Arg Leu Ala Ser Asn Val Ala Lys Arg Lys Ser Ser Met Pro Gin 
180 185 190 

AAA TTT CTT GGA GAC AAG TGC CTG TCA GAC ATG CCC TAT GAC AGT GCC 
Lys Phe Leu Gly Asp Lys Cys Leu Ser Asp Met Pro Tyr Asp Ser Ala 
195 200 205 

AAC TAT GAG AAG GAG GAT ATG ATG ACA TCC CAC GTG ATG GAC CAG GCC 
Asn Tyr Glu Lys Glu Asp Met Met Thr Ser His Val Met Asp Gin Ala 
210 215 220 

ATC AAC AAT GCC ATC AAC TAC CTG GGG GCT GAG TCC CTG CGC CCA TTG 
lie Asn Asn Ala lie Asn Tyr Leu Gly Ala Glu Ser Leu Arg Pro Leu 
225 230 235 240 

GTG CAG ACA CCC CCC GGT AGC TCC GAG GTG GTG CCA GTC ATC AGC TCC 
Val Gin Thr Pro Pro Gly Ser Ser Glu Val Val Pro Val lie Ser Ser 
245 250 255 

ATG TAC CAG CTG CAC AAG CCC CCC TCA GAT GGC CCC CCA CGG TCC AAC 
Met Tyr Gin Leu His Lys Pro Pro Ser Asp Gly Pro Pro Arg Ser Asn 
260 265 270 

CAT TCA GCA CAG GAC GCC GTG GAT AAC TTG CTG CTG CTG TCC AAG GCC 
His Ser Ala Gin Asp Ala Val Asp Asn Leu Leu Leu Leu Ser Lys Ala 
275 1 280 285 

AAG TCT GTG TCA TCG GAG CGA GAG GCC TCC CCG AGC AAC AGC TGC CAA 
Lys Ser Val Ser Ser Glu Arg Glu Ala Ser Pro Ser Asn Ser Cys Gin 
290 295 300 



GAC TCC ACA GAT ACA GAG AGC AAC GCG GAG GAA CAG CGC AGC GGC CTT 
Asp Ser Thr Asp Thr Glu Ser Asn Ala Glu Glu Gin Arg Ser Gly Leu 
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o 

Ln 



10 



305 310 315 320 

ATC TAG CTA ACC AAC CAC ATC AAC CCG CAT GCA CGC AAT GGG CTG GCT 12 3 0 

lie Tyr Leu Thr Asn His lie Asn Pro His Ala Arg Asn Gly Leu Ala 
325 330 335 

CTC AAG GAG GAG CAG CGC GCC TAC GAG GTG CTG AGG GCG GCC TCA GAG 12 7 8 

Leu Lys Glu Glu Gin Arg Ala Tyr Glu Val Leu Arg Ala Ala Ser Glu 
340 345 350 

AAC TCG CAG GAT GCC TTC CGT GTG GTC AGC ACG AGT GGC GAG CAG CTG 132 6 

Asn Ser Gin Asp Ala Phe Arg Val Val Ser Thr Ser Gly Glu Gin Leu 
355 360 365 

15 AAG GTG TAC AAG TGC GAA CAC TGC CGC GTG CTC TTC CTG GAT CAC GTC 13 74 

Lys Val Tyr Lys Cys Glu His Cys Arg Val Leu Phe Leu Asp His Val 
370 375 380 

ATG TAT ACC ATT CAC ATG GGC TGC CAT GGC TGC CAT GGC TTT CGG GAT 142 2 

20 Met Tyr Thr lie His Met Gly Cys His Gly Cys His Gly Phe Arg Asp 
385 390 395 400 

CCC TTT GAG TGT AAC ATG TGT GGT TAT CAC AGC CAG GAC AGG TAC GAG 14 7 0 

Pro Phe Glu Cys Asn Met Cys Gly Tyr His Ser Gin Asp Arg Tyr Glu 
25 405 410 415 

TTC TCA TCC CAT ATC ACG CGG GGG GAG CAT CGT TAC CAC CTG AGC 1515 
Phe Ser Ser His lie Thr Arg Gly Glu His Arg Tyr His Leu Ser 
420 425 430 

30 

TAAACCCAGC CAGGCCCCAC TGAAGCACAA AGATAGCTGG TTATGCCTCC TTCCCGGCAG 1575 
CTGGACCCAC AGCGGACAAT GTGGGAGTGG ATTTGCAGGC AGCATTTGTT CTTTTATGTT 16 3 5 
35 GGTTGTTTGG CGTTTCATTT GCGTTGGAAG ATAAGTTTTT AATGTTAGTG ACAGGATTGC 16 95 
ATTGCATCAG CAACATTCAC AAC ATC CATC CTTCTAGCCA GTTTTGTTCA CTGGTAGCTG 17 5 5 
AGGTTTCCCG GATATGTGGC TTCCTAACAC TCT 178 8 

(2) INFORMATION FOR SEQ ID NO : 16 : 



40 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 86 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



50 



55 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1386 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



AAT GTT AAA GTA 
48 

Asn Val Lys Val 
1 

GAA ATG AAT GGG 
96 

Glu Met Asn Gly 
20 

TCG GGA GAG AAA 
144 

Ser Gly Glu Lys 
35 

TTG TCG GGA GTT 
192 

Leu Ser Gly Val 
50 

GAT ATC TGT GGG 
240 

Asp lie Cys Gly 
65 

AAA AGA AGC CAC 
288 

Lys Arg Ser His 



GCC TCA TTC ACC 
336 

Ala Ser Phe Thr 
100 

TCC GGG GAG AAG 
384 

Ser Gly Glu Lys 
115 

CGG AGG GAC GCC 
432 

Arg Arg Asp Ala 
130 

CCT CAC AAA TGT 
480 

Pro His Lys Cys 
145 

TTA GAG GAA CAT 
528 

Leu Glu Glu His 



GAG ACT CAG AGT 

Glu Thr Gin Ser 
5 

GAA GAA TGT GCG 
Glu Glu Cys Ala 

ATG AAT GGC TCC 

Met Asn Gly Ser 
40 

GGA GGC ATT CGA 

Gly Gly lie Arg 
55 

ATC ATT TGC ATC 

lie lie Cys lie 
70 

ACT GGA GAA CGG 

Thr Gly Glu Arg 
85 

CAG AAG GGC AAC 
Gin Lys Gly Asn 

CCC TTC AAA TGC 

Pro Phe Lys Cys 
120 

CTC ACT GGC CAC 

Leu Thr Gly His 
135 

GGA TAT TGT GGC 

Gly Tyr Cys Gly 
150 

AAA GAG CGC TGC 

Lys Glu Arg Cys 
165 
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GAT GAA GAG AAT 

Asp Glu Glu Asn 
10 

GAG GAT TTA CGA 

Glu Asp Leu Arg 
25 

CAC AGG GAC CAA 
His Arg Asp Gin 

CTT CCT AAC GGA 

Leu Pro Asn Gly 
60 

GGG CCC AAT GTG 

Gly Pro Asn Val 
75 

CCC TTC CAG TGC 

Pro Phe Gin Cys 
90 

CTG CTC CGG CAC 

Leu Leu Arg His 
105 

CAC CTC TGC AAC 
His Leu Cys Asn 

CTG AGG ACG CAC 

Leu Arg Thr His 
140 

CGA AGC TAT AAA 

Arg Ser Tyr Lys 
155 

CAC AAC TAC TTG 

His Asn Tyr Leu 
170 



GGG CGT GCC TGT 

Gly Arg Ala Cys 
15 

ATG CTT GAT GCC 

Met Leu Asp Ala 
30 

GGC AGC TCG GCT 

Gly Ser Ser Ala 
45 

AAA CTA AAG TGT 
Lys Leu Lys Cys 

CTC ATG GTT CAC 

Leu Met Val His 
80 

AAT CAG TGC GGG 

Asn Gin Cys Gly 
95 

ATC AAG CTG CAT 

lie Lys Leu His 
110 

TAC GCC TGC CGC 

Tyr Ala Cys Arg 
125 

TCC GTT GGT AAA 
Ser Val Gly Lys 

CAG CGA ACG TCT 

Gin Arg Thr Ser 
160 

GAA AGC ATG GGC 

Glu Ser Met Gly 
175 



CTT CCG GGC ACA 
576 

Leu Pro Gly Thr 
180 

GAA ATG GCA GAA 
624 

Glu Met Ala Glu 
195 

CTG GAC AGA CTA 
672 

Leu Asp Arg Leu 
210 

CAG AAA TTT CTT 
720 

Gin Lys Phe Leu 

225 

GCC ACG TAC GAG 
768 

Ala Thr Tyr Glu 



CAA GCC ATC AAC 
816 

Gin Ala lie Asn 
260 

CCG CTG GTG CAG 
864 

Pro Leu Val Gin 
275 

AGC CCG ATG TAC 
912 

Ser Pro Met Tyr 
290 

AAC CAC TCG GCC 
960 

Asn His Ser Ala 
305 

AAG GCC AAG TTG 
1008 

Lys Ala Lys Leu 



TGC CAA GAC TCC 
1056 

Cys Gin Asp Ser 
340 

GGT CTT ATC TAC 
1104 



CTG TAC CCA GTC 
Leu Tyr Pro Val 

GAC CTG TGC AAG 

Asp Leu Cys Lys 
200 

GCA AGT AAT GTC 

Ala Ser Asn Val 
215 

GGG GAC AAG GGC 

Gly Asp Lys Gly 
230 

AAG GAG AAC GAA 

Lys Glu Asn Glu 
245 

AAC GCC ATC AAC 
Asn Ala lie Asn 

ACG CCC CCG GGC 

Thr Pro Pro Gly 
280 

CAG CTG CAC AGG 

Gin Leu His Arg 
295 

CAG GAC AGC GCC 

Gin Asp Ser Ala 
310 

GTG CCC TCG GAG 

Val Pro Ser Glu 
325 

ACG GAC ACC GAG 
Thr Asp Thr Glu 

CTG ACC AAC CAC 



- 113- 

ATT AAA GAA GAA 

lie Lys Glu Glu 
185 

ATA GGA TCA GAG 
lie Gly Ser Glu 

GCC AAA CGT AAG 

Ala Lys Arg Lys 
220 

CTG TCC GAC ACG 

Leu Ser Asp Thr 
235 

ATG ATG AAG TCC 

Met Met Lys Ser 
250 

TAC CTG GGG GCC 

Tyr Leu Gly Ala 
265 

GGT TCC GAG GTG 
Gly Ser Glu Val 

CGC TCG GAG GGC 

Arg Ser Glu Gly 
300 

GTG GAG TAC CTG 

Val Glu Tyr Leu 
315 

CGC GAG GCG TCC 

Arg Glu Ala Ser 
330 

AGC AAC AAC GAG 

Ser Asn Asn Glu 
345 

ATC GCC CGA CGC 



ACT AAG CAC AGT 

Thr Lys His Ser 
190 

AGA TCT CTC GTG 

Arg Ser Leu Val 
205 

AGC TCT ATG CCT 
Ser Ser Met Pro 

CCC TAC GAC AGT 

Pro Tyr Asp Ser 
240 

CAC GTG ATG GAC 

His Val Met Asp 
255 

GAG TCC CTG CGC 

Glu Ser Leu Arg 
270 

GTC CCG GTC ATC 

Val Pro Val lie 
285 

ACC CCG CGC TCC 
Thr Pro Arg Ser 

CTG CTG CTC TCC 

Leu Leu Leu Ser 
320 

CCG AGC AAC AGC 

Pro Ser Asn Ser 
335 

GAG CAG CGC AGC 

Glu Gin Arg Ser 
350 

GCG CAA CGC GTG 
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Gly Leu lie Tyr Leu Thr Asn His lie Ala Arg Arg Ala Gin Arg Val 
355 360 365 



TCG CTC AAG GAG GAG CAC CGC GCC TAC GAC CTG CTG CGC GCC GCC TCC 
1152 

Ser Leu Lys Glu Glu His Arg Ala Tyr Asp Leu Leu Arg Ala Ala Ser 

370 375 380 



GAG AAC TCG CAG GAC GCG CTC CGC GTG GTC AGC ACC AGC GGG GAG CAG 
1200 

Glu Asn Ser Gin Asp Ala Leu Arg Val Val Ser Thr Ser Gly Glu Gin 
385 390 395 400 



ATG AAG GTG TAC AAG TGC GAA CAC TGC CGG GTG CTC TTC CTG GAT CAC 
1248 

Met Lys Val Tyr Lys Cys Glu His Cys Arg Val Leu Phe Leu Asp His 
405 410 415 



GTC ATG TAC ACC ATC CAC ATG GGC TGC CAC GGC TTC CGT GAT CCT TTT 
1296 

Val Met Tyr Thr lie His Met Gly Cys His Gly Phe Arg Asp Pro Phe 
420 425 430 



GAG TGC AAC ATG TGC GGC TAC CAC AGC CAG GAC CGG TAC GAG TTC TCG 
1344 

Glu Cys Asn Met Cys Gly Tyr His Ser Gin Asp Arg Tyr Glu Phe Ser 
435 440 445 



TCG CAC ATA ACG CGA GGG GAG CAC CGC TTC CAC ATG AGC TAA 
1386 

Ser His lie Thr Arg Gly Glu His Arg Phe His Met Ser 
450 455 460 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..1296 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATG GAT GTC GAT GAG GGT CAA GAC ATG TCC CAA GTT TCA GGA AAG GAG 
48 

Met Asp Val Asp Glu Gly Gin Asp Met Ser Gin Val Ser Gly Lys Glu 
15 10 15 



AGC CCC CCA GTC 
96 

Ser Pro Pro Val 
20 

GTC CCT GAG GAC 
144 

Val Pro Glu Asp 
35 

AGT GAT CGA GGC 
192 

Ser Asp Arg Gly 
50 

GAA GAG AAT GGG 
240 

Glu Glu Asn Gly 
65 

GAT TTA CGA ATG 
288 

Asp Leu Arg Met 



AGG GAC CAA GGC 
336 

Arg Asp Gin Gly 
100 

CCT AAC GGA AAA 
384 

Pro Asn Gly Lys 
115 

CCC AAT GTG CTC 
432 

Pro Asn Val Leu 
130 

TTC CAG TGC AAC 
480 

Phe Gin Cys Asn 
145 

CTG CGG CAC ATC 
528 

Leu Arg His lie 



CTT TGC AAC TAT 
576 

Leu Cys Asn Tyr 
180 

AGG ACG CAC TCC 
624 



AGT GAC ACT CCA 
Ser Asp Thr Pro 

CTG TCC ACT ACC 

Leu Ser Thr Thr 
40 

ATG GCC AGT AAT 

Met Ala Ser Asn 
55 

CGT GCC TGT GAA 

Arg Ala Cys Glu 
70 

CTT GAT GCC TCG 

Leu Asp Ala Ser 
85 

AGC TCG GCT TTG 
Ser Ser Ala Leu 

CTA AAG TGT GAT 

Leu Lys Cys Asp 
120 

ATG GTT CAC AAA 

Met Val His Lys 
135 

CAG TCT GGG GCC 

Gin Ser Gly Ala 
150 

AAG CTG CAC TCG 

Lys Leu His Ser 
165 

GCC TGC CGC CGG 
Ala Cys Arg Arg 

GGA GAC AAG TGC 



- 115- 

GAT GAA GGG GAT 

Asp Glu Gly Asp 
25 

TCT GGA GCA CAG 
Ser Gly Ala Gin 

GTT AAA GTA GAG 

Val Lys Val Glu 
60 

ATG AAT GGG GAA 

Met Asn Gly Glu 
75 

GGA GAG AAA ATG 

Gly Glu Lys Met 
90 

TCA GGA GTT GGA 

Ser Gly Val Gly 
105 

ATC TGT GGG ATC 
lie Cys Gly lie 

AGA AGT CAT ACT 

Arg Ser His Thr 
140 

TCC TTT ACC CAG 

Ser Phe Thr Gin 
155 

GGT GAG AAG CCC 

Gly Glu Lys Pro 
170 

AGG GAC GCC CTC 

Arg Asp Ala Leu 
185 

CTG TCA GAC ATG 



GAG CCC ATG CCT 

Glu Pro Met Pro 
30 

CAG AAC TCC AAG 

Gin Asn Ser Lys 
45 

ACT CAG AGT GAT 
Thr Gin Ser Asp 

GAA TGT GCA GAG 

Glu Cys Ala Glu 
80 

AAT GGC TCC CAC 

Asn Gly Ser His 
95 

GGC ATT CGA CTT 

Gly lie Arg Leu 
110 

GTT TGC ATC GGG 

Val Cys lie Gly 
12 5 

GGT GAA CGG CCT 
Gly Glu Arg Pro 

AAA GGC AAC CTC 

Lys Gly Asn Leu 
160 

TTC AAA TGC CAT 

Phe Lys Cys His 
175 

ACC GGC CAC CTG 

Thr Gly His Leu 
190 

CCC TAT GAC AGT 



Arg Thr His Ser Gly Asp 
195 

GCC AAC TAT GAG AAG GAG 
672 

Ala Asn Tyr Glu Lys Glu 
210 

GCC ATC AAC AAT GCC ATC 
720 

Ala lie Asn Asn Ala lie 
225 230 

TTG GTG CAG ACA CCC CCC 
768 

Leu Val Gin Thr Pro Pro 
245 

TCC ATG TAC CAG CTG CAC 
816 

Ser Met Tyr Gin Leu His 
260 

AAC CAT TCA GCA CAG GAC 
864 

Asn His Ser Ala Gin Asp 
275 

GCC AAG TCT GTG TCA TCG 

912 

Ala Lys Ser Val Ser Ser 
290 

CAA GAC TCC ACA GAT ACA 
960 

Gin Asp Ser Thr Asp Thr 
305 310 

CTT ATC TAC CTA ACC AAC 
1008 

Leu lie Tyr Leu Thr Asn 
325 

GCT CTC AAG GAG GAG CAG 
1056 

Ala Leu Lys Glu Glu Gin 
340 

GAG AAC TCG CAG GAT GCC 
1104 

Glu Asn Ser Gin Asp Ala 
355 

CTG AAG GTG TAC AAG TGC 
1152 

Leu Lys Val Tyr Lys Cys 
370 



- 116- 

Lys Cys Leu Ser Asp 
200 

GAT ATG ATG ACA TCC 

Asp Met Met Thr Ser 
215 

AAC TAC CTG GGG GCT 

Asn Tyr Leu Gly Ala 
235 

GGT AGC TCC GAG GTG 

Gly Ser Ser Glu Val 
2S0 r 

AAG CCC CCC TCA GAT 

Lys Pro Pro Ser Asp 
265 

GCC GTG GAT AAC TTG 

Ala Val Asp Asn Leu 
280 

GAG CGA GAG GCC TCC 

Glu Arg Glu Ala Ser 
295 

GAG AGC AAC GCG GAG 

Glu Ser Asn Ala Glu 
315 

CAC ATC AAC CCG CAT 

His lie Asn Pro His 
330 

CGC GCC TAC GAG GTG 

Arg Ala Tyr Glu Val 

.345 

TTC CGT GTG GTC AGC 

Phe Arg Val Val Ser 
360 

GAA CAC TGC CGC GTG 

Glu His Cys Arg Val 
375 



Met Pro Tyr Asp Ser 
205 

CAC GTG ATG GAC CAG 

His Val Met Asp Gin 
220 

GAG TCC CTG CGC CCA 

Glu Ser Leu Arg Pro 
240 

GTG CCA GTC ATC AGC 

Val Pro Val lie Ser 
255 

GGC CCC CCA CGG TCC 

Gly Pro Pro Arg Ser 
270 

CTG CTG CTG TCC AAG 

Leu Leu Leu Ser Lys 
285 

CCG AGC AAC AGC TGC 

Pro Ser Asn Ser Cys 
300 

GAA CAG CGC AGC GGC 

Glu Gin Arg Ser Gly 
320 

GCA CGC AAT GGG CTG 

Ala Arg Asn Gly Leu 
335 

CTG AGG GCG GCC TCA 

Leu Arg Ala Ala Ser 
350 

ACG AGT GGC GAG CAG 

Thr Ser Gly Glu Gin 
365 

CTC TTC CTG GAT CAC 

Leu Phe Leu Asp His 
380 



- 117- 



GTC ATG TAT ACC ATT CAC ATG GGC TGC CAT GGC TGC CAT GGC TTT CGG 
1200 

Val Met Tyr Thr lie His Met Gly Cys His Gly Cys His Gly Phe Arg 
385 390 395 400 

GAT CCC TTT GAG TGT AAC ATG TGT GGT TAT CAC AGC CAG GAC AGG TAC 
1248 

Asp Pro Phe Glu Cys Asn Met Cys Gly Tyr His Ser Gin Asp Arg Tyr 
405 . 410 415 

GAG TTC TCA TCC CAT ATC ACG CGG GGG GAG CAT CGT TAC CAC CTG AGC 
1296 

Glu Phe Ser Ser His lie Thr Arg Gly Glu His Arg Tyr His Leu Ser 
420 425 430 

(2) INFORMATION FOR SEQ ID NO : 18 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 223.. 1776 



(xi) SEQUENCE DESCRIPTION-: SEQ ID NO: 18: 

AATTCGTTCT ACCTTCTCTG AACCCCAGTG GTGTGTCAAG GCCGGACTGG GAGCTTGGGG 
60 

GAAGAGGAAG AGGAAGAGGA ATCTGCGGCT CATCCAGGGA TCAGGGTCCT TCCCAAGTGG 
120 

CCACTCAGAG GGGAC TC AG A GCAAGTCTAG ATTTGTGTGG CAGAGAGAGA CAGCTCTCGT 
180 

TTGGCCTTGG GG AGG C AC AA GTCTGTTGAT AACCTGAAGA CA ATG GAT GTC GAT 
234 

Met Asp Val Asp 
1 

GAG GGT CAA GAC ATG TCC CAA GTT TCA GGA AAG GAG AGC CCC CCA GTC 
282 

Glu Gly Gin Asp Met- Ser Gin Val Ser Gly Lys Glu Ser Pro Pro Val 
5 10 15 20 

AGT GAC ACT CCA GAT GAA GGG GAT GAG CCC ATG CCT GTC CCT GAG GAC 
330 

Ser Asp Thr Pro Asp Glu Gly Asp Glu Pro Met Pro Val Pro Glu Asp 
25 30 35 



118- 



CTG TCC ACT ACC TCT GGA GCA CAG CAG AAC TCC AAG AGT GAT CGA GGC 
378 

Leu Ser Thr Thr Ser Gly Ala Gin Gin Asn Ser Lys Ser Asp Arg Gly 
40 45 50 

ATG GCC AGT AAT GTT AAA GTA GAG ACT CAG AGT GAT GAA GAG AAT GGG 
426 

Met Ala Ser Asn Val Lys Val Glu Thr Gin Ser Asp Glu Glu Asn Gly 
55 60 65 

CGT GCC TGT GAA ATG AAT GGG GAA GAA TGT GCA GAG GAT TTA CGA ATG 
474 

Arg Ala Cys Glu Met Asn Gly Glu Glu Cys Ala Glu Asp Leu Arg Met 
70 75 80 

CTT GAT GCC TCG GGA GAG AAA ATG AAT GGC TCC CAC AGG GAC CAA GGC 
522 

Leu Asp Ala Ser Gly Glu Lys Met Asn Gly Ser His Arg Asp Gin Gly 

85 90 95 100 

AGC TCG GCT TTG TCA GGA GTT GGA GGC ATT CGA CTT CCT AAC GGA AAA 
570 

Ser Ser Ala Leu Ser Gly Val Gly Gly lie Arg Leu Pro Asn Gly Lys 
105 110 115 

CTA AAG TGT GAT ATC TGT GGG ATC GTT TGC ATC GGG CCC AAT GTG CTC 
618 

Leu Lys Cys Asp lie Cys Gly lie Val Cys lie Gly Pro Asn Val Leu 
120 125 130 

ATG GTT CAC AAA AGA AGT CAT ACT GGT GAA CGG CCT TTC CAG TGC AAC 
666 _ 

Met Val His Lys Arg Ser His Thr Gly Glu Arg Pro Phe Gin Cys Asn 
135 140 145 

CAG TCT GGG GCC TCC TTT ACC CAG AAA GGC AAC CTC CTG CGG CAC ATC 
714 

Gin Ser Gly Ala Ser Phe Thr Gin Lys Gly Asn Leu Leu Arg His lie 
150 155 160 

AAG CTG CAC TCG GGT GAG AAG CCC TTC AAA TGC CAT CTT TGC AAC TAT 
762 

Lys Leu His Ser Gly Glu Lys Pro Phe Lys Cys His Leu Cys Asn Tyr 
165 170 175 180 

GCC TGC CGC CGG AGG GAC GCC CTC ACC GGC CAC CTG AGG ACG CAC TCC 
810 

Ala Cys Arg Arg Arg Asp Ala Leu Thr Gly His Leu Arg Thr His Ser 
185 190 195 

GTT GGT. AAG CCT CAC AAA TGT GGA TAT TGT GGC CGG AGC TAT AAA CAG 
858 

Val Gly Lys Pro His Lys Cys Gly Tyr Cys Gly Arg Ser Tyr Lys Gin 
200 205 210 



CGA AGC TCT TTA GAG 
906 

Arg Ser Ser Leu Glu 
215 

AGC ATG GGC CTT CCG 
954 

Ser Met Gly Leu Pro 
230 

CAC AAC GAG ATG GCA 
1002 

His Asn Glu Met Ala 
245 

CTT GTC CTG GAC AGG 
1050 

Leu Val Leu Asp Arg 
265 

ATG CCT CAG AAA TTT 
1098 

Met Pro Gin Lys Phe 
280 

GAC AGT GCC AAC TAT 
1146 

Asp Ser Ala Asn Tyr 
295 

GAC CAG GCC ATC AAC 
1194 

Asp Gin Ala He Asn 
310 

CGC CCA TTG GTG CAG 
1242 

Arg Pro Leu Val Gin 
325 

ATC AGC TCC ATG TAC 
1290 

He Ser Ser Met Tyr 
345 

CGG TCC AAC CAT TCA 
1338 

Arg Ser Asn His . Ser 
360 

TCC AAG GCC AAG TCT 
1386 

Ser Lys Ala Lys Ser 
375 

AGC TGC CAA GAC TCC 
1434 
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GAG CAT AAA GAG CGA 

Glu His Lys Glu Arg 
220 

GGC GTG TGC CCA GTC 

Gly Val Cys Pro Val 
235 

GAA GAC CTG TGC AAG 

Glu Asp Leu Cys Lys 
250 

CTG GCA AGC AAT GTC 

Leu Ala Ser Asn Val 
270 

CTT GGA GAC AAG TGC 

Leu Gly Asp Lys Cys 
285 

GAG AAG GAG GAT ATG 

Glu Lys Glu Asp Met 
300 

AAT GCC ATC AAC TAC 

Asn Ala He Asn Tyr 
315 

ACA CCC CCC GGT AGC 

Thr Pro Pro Gly Ser 
330 

CAG CTG CAC AAG CCC 

Gin Leu His Lys Pro 
350 

GCA CAG GAC GCC GTG 

Ala Gin Asp Ala Val 
365 

GTG TCA TCG GAG CGA 

Val Ser Ser Glu Arg 
380 

ACA GAT ACA GAG AGC 



TGC CAC AAC TAC TTG GAA 

Cys His Asn Tyr Leu Glu 
225 

ATT AAG GAA GAA ACT AAC 

He Lys Glu Glu Thr Asn 
240 

ATA GGA GCA GAG AGG TCC 

He Gly Ala Glu Arg Ser 
255 260 

GCC AAA CGT AAG AGC TCT 

Ala Lys Arg Lys Ser Ser 
275 

CTG TCA GAC ATG CCC TAT 

Leu Ser Asp Met Pro Tyr 
290 

ATG ACA TCC CAC GTG ATG 

Met Thr Ser His Val Met 
305 

CTG GGG GCT GAG TCC CTG 

Leu Gly Ala Glu Ser Leu 
320 

TCC GAG GTG GTG CCA GTC 

Ser Glu Val Val Pro Val 
335 340 

CCC TCA GAT GGC CCC CCA 

Pro Ser Asp Gly Pro Pro 
355 

GAT AAC TTG CTG CTG CTG 

Asp Asn Leu Leu Leu Leu 
370 

GAG GCC TCC CCG AGC AAC 

Glu Ala Ser Pro Ser Asn 
385 

AAC GCG GAG GAA CAG CGC 
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Ser Cys Gin Asp Ser Thr Asp Thr Glu Ser Asn Ala Glu Glu Gin Arg 
390 395 400 

AGC GGC CTT ATC TAC CTA ACC AAC CAC ATC AAC. CCG CAT GCA CGC AAT 
1482 

Ser Gly Leu lie Tyr Leu Thr Asn His lie Asn Pro His Ala Arg Asn 

405 410 415 420 

GGG CTG GCT CTC AAG GAG GAG CAG CGC GCC TAC GAG GTG CTG AGG GCG 
1530 

Gly Leu Ala Leu Lys Glu Glu Gin Arg Ala Tyr Glu Val Leu Arg Ala 
425 430 435 

GCC TCA GAG AAC TCG CAG GAT GCC TTC CGT GTG GTC AGC ACG AGT GGC 
1578 

Ala Ser Glu Asn Ser Gin Asp Ala Phe Arg Val Val Ser Thr Ser Gly 
440 445 450 

GAG CAG CTG AAG GTG TAC AAG TGC GAA CAC TGC CGC GTG CTC TTC CTG 
1626 

Glu Gin Leu Lys Val Tyr Lys Cys Glu His Cys Arg Val Leu Phe Leu 

455 460 465 

GAT CAC GTC ATG TAT ACC ATT CAC ATG GGC TGC CAT GGC TGC CAT GGC 
1674 

Asp His Val Met Tyr Thr lie His Met Gly Cys His Gly Cys His Gly 
470 475 480 

TTT CGG GAT CCC TTT GAG TGT AAC ATG TGT GGT TAT CAC AGC CAG GAC 

1722 /■ 

Phe Arg Asp Pro Phe Glu Cys Asn Met Cys Gly Tyr His Ser Gin Asp 

485 490 495 500 

AGG TAC GAG TTC TCA TCC CAT ATC ACG CGG GGG GAG CAT CGT TAC CAC 
1770 

Arg Tyr Glu Phe Ser Ser His lie Thr Arg Gly Glu His Arg Tyr His 
505 510 515 

CTG AGC TAAACCCAGC CAGGCCCCAC TGAAGCACAA AGATAGC TGG TTATGCCTCC 

1826 

Leu Ser 



TTCCCGGCAG CTGGACCCAC AGCGGACAAT GTGGGAGTGG ATTTGCAGGC AGCATTTGTT 
1886 

CTTTTATGTT GGTTGTTTGG CGTTTCATTT GCGTTGGAAG ATAAGTTTTT AATGTTAGTG 
1946 

ACAGGATTGC ATTGCATCAG CAACATTCAC AACATCCATC CTTCTAGCCA GTTTTGTTCA 
2006 

CTGGTAGCTG AGGTTTCCCG GATATGTGGC TTCCTAACAC TCT 
2049 



(2) INFORMATION FOR SEQ ID NO: 19: 



-121- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1170 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



GAG GGT CAA GAC ATG TCC CAA GTT 

Glu Gly Gin Asp Met Ser Gin Val 
5 10 

AGT GAC ACT CCA GAT GAA GGG GAT 

Ser Asp Thr Pro Asp Glu Gly Asp 
25 



ATG GAT GTC GAT 
48 

Met Asp Val Asp 
1 

AGC CCC CCA GTC 
96 

Ser Pro Pro Val 
20 

GTC CCT GAG GAC 
144 

Val Pro Glu Asp 
35 



GCC TCC TTT ACC 
240 

Ala Ser Phe Thr 
65 

TCG GGT GAG AAG 
288 

Ser Gly Glu Lys 



CGG AGG GAC GCC 
336 

Arg Arg Asp Ala 
100 



CTG TCC ACT ACC 

Leu Ser Thr Thr 
40 



CAG AAA GGC AAC 

Gin Lys Gly Asn 
70 

CCC TTC AAA TGC 

Pro Phe Lys Cys 
85 

CTC ACC GGC CAC 
Leu Thr Gly His 



TCT GGA GCA CAG 
Ser Gly Ala Gin 

CCT TTC CAG TGC 

Pro Phe Gin Cys 
60 

CTC CTG CGG CAC 

Leu Leu Arg His 
75 

CAT CTT TGC AAC 

His Leu Cys Asn 
90 

CTG AGG ACG CAC 

Leu Arg Thr His 
105 



TCA GGA AAG GAG 

Ser Gly Lys Glu 
15 

GAG CCC ATG CCT 

Glu Pro Met Pro 
30 

CAG AAC TCC AAG 

Gin Asn Ser Lys 
45 

AAC CAG TCT GGG 
Asn Gin Ser Gly 

ATC AAG CTG CAC 

lie Lys Leu His 
80 

TAT GCC TGC CGC 

Tyr Ala Cys Arg 
95 

TCC GTC ATT AAG 

Ser Val lie Lys 
110 



AGT GAT CGA GGC ATG GGT GAA CGG 
192- - ■ - - 

Ser Asp Arg Gly Met Gly Glu Arg 
50 55 



GAA GAA ACT AAC CAC AAC GAG ATG GCA GAA GAC CTG TGC AAG ATA GGA 
3 84 

Glu Glu Thr Asn His Asn Glu Met Ala Glu Asp Leu Cys Lys lie Gly 
115 120 125 
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GCA GAG AGG TCC CTT GTC CTG GAC ■ AGG CTG GCA AGC AAT GTC GCC AAA 
432 

Ala Glu Arg Ser Leu Val Leu Asp Arg Leu Ala Ser Asn Val Ala Lys 
130 135 140 

CGT AAG AGC TCT ATG CCT CAG AAA TTT CTT GGA GAC AAG TGC CTG TCA 
480 

Arg Lys Ser Ser Met Pro Gin Lys Phe Leu Gly Asp Lys Cys Leu Ser 
145 150 155 160 

GAC ATG CCC TAT GAC AGT GCC AAC TAT GAG AAG GAG GAT ATG ATG ACA 
528 

Asp Met Pro Tyr Asp Ser Ala Asn Tyr Glu Lys Glu Asp Met Met Thr 
165 170 175 

TCC CAC GTG ATG GAC CAG GCC ATC AAC AAT GCC ATC AAC TAC CTG GGG 
576 

Ser His Val Met Asp Gin Ala lie Asn Asn Ala He Asn Tyr Leu Gly 
180 185 190 

GCT GAG TCC CTG CGC CCA TTG GTG CAG ACA CCC CCC GGT AGC TCC GAG 
624 

Ala Glu Ser Leu Arg Pro Leu Val Gin Thr Pro Pro Gly Ser Ser Glu 
195 200 205 

GTG GTG CCA GTC ATC AGC TCC ATG TAC CAG CTG CAC AAG CCC CCC TCA 
672 

Val Val Pro Val He Ser Ser Met Tyr Gin Leu His Lys Pro Pro Ser 
210 215 220 

GAT GGC CCC CCA CGG TCC AAC CAT TCA GCA CAG GAC GCC GTG GAT AAC 
720 

Asp Gly Pro Pro Arg Ser Asn His Ser -Ala Gin Asp Ala Val Asp Asn 

225 230 235 240 

TTG CTG CTG CTG TCC AAG GCC AAG TCT GTG TCA TCG GAG CGA GAG GCC 
768 

Leu Leu Leu Leu Ser Lys Ala Lys Ser Val Ser Ser Glu Arg Glu Ala 
245 250 255 

TCC CCG AGC AAC AGC TGC CAA GAC TCC ACA GAT ACA GAG AGC AAC GCG 
816 

Ser Pro Ser Asn Ser Cys Gin Asp Ser Thr Asp Thr Glu Ser Asn Ala 
260 265 * 270 

GAG GAA CAG CGC AGC GGC CTT ATC TAC CTA ACC AAC CAC ATC AAC CCG 
864 

Glu Glu Gin Arg Ser Gly Leu He Tyr Leu Thr Asn His He Asn Pro 
275 280 285 

CAT GCA CGC AAT GGG CTG GCTCTC AAG GAG GAG CAG CGC GCC TAC GAG 
912 

His Ala Arg Asn Gly Leu Ala Leu Lys Glu Glu Gin Arg Ala Tyr Glu 
290 295 300 

GTG CTG AGG GCG GCC TCA GAG AAC TCG CAG GAT GCC TTC CGT GTG GTC 
960 



- 123 - 



Val Leu Arg Ala Ala Ser Glu Asn Ser Gin Asp Ala Phe Arg Val Val 
305 310 315 320 

AGC ACG AGT GGC GAG CAG CTG AAG GTG TAC AAG TGC GAA CAC TGC CGC 
1008 

Ser Thr Ser Gly Glu Gin Leu Lys Val Tyr Lys Cys Glu His Cys Arg 
325 330 335 

GTG CTC TTC CTG GAT CAC GTC ATG TAT ACC ATT CAC ATG GGC TGC CAT 
1056 

Val Leu Phe Leu Asp His Val Met Tyr Thr lie His Met Gly Cys His 
340 345 350 

GGC TGC CAT GGC TTT CGG GAT CCC TTT GAG TGT AAC ATG TGT GGT TAT 
1104 

Gly Cys His Gly Phe Arg Asp Pro Phe Glu Cys Asn Met Cys Gly Tyr 
355 360 365 

CAC AGC CAG GAC AGG TAC GAG TTC TCA TCC CAT ATC ACG CGG GGG GAG 
1152 

His Ser Gin Asp Arg Tyr Glu Phe Ser Ser His lie Thr Arg Gly Glu 
370 375 380 

CAT CGT TAC CAC CTG AGC 
1170 

His Arg Tyr His Leu Ser 
385 390 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 8 base pairs 

(B) TYPE: nucleic acid _ . 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..1128 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATG GAT GTC GAT GAG GGT CAA GAC ATG TCC CAA GTT TCA GGA AAG GAG 
48 

Met Asp Val Asp Glu Gly Gin Asp Met Ser Gin Val Ser Gly Lys Glu 
15 10 15 

AGC CCC CCA GTC AGT GAC ACT CCA GAT GAA GGG GAT GAG CCC ATG CCT 
96 

Ser Pro Pro Val Ser Asp Thr Pro Asp Glu Gly Asp Glu Pro Met Pro 
20 25 30 



- 124- 

GTC CCT GAG GAC CTG TCC ACT ACC TCT GGA GCA CAG CAG AAC TCC AAG 
144 

Val Pro Glu Asp Leu Ser Thr Thr Ser Gly Ala Gin Gin Asn Ser Lys 
35 40 45 

AGT GAT CGA GGC ATG GCC AGT AAT GTT AAA GTA GAG ACT CAG AGT GAT 
192 

Ser Asp Arg Gly Met Ala Ser Asn Val Lys Val Glu Thr Gin Ser Asp 
50 55 60 

GAA GAG AAT GGG CGT GCC TGT GAA ATG AAT GGG GAA GAA TGT GCA GAG 
240 

Glu Glu Asn Gly Arg Ala Cys Glu Met Asn Gly Glu Glu Cys Ala Glu 
65 70 75 80 

GAT TTA CGA ATG CTT GAT GCC TCG GGA GAG AAA ATG AAT GGC TCC CAC 
288 

Asp Leu Arg Met Leu Asp Ala Ser Gly Glu Lys Met Asn Gly Ser His 
85 90 95 

AGG GAC CAA GGC AGC TCG GCT TTG TCA GGA GTT GGA GGC ATT CGA CTT 
336 

Arg Asp Gin Gly Ser Ser Ala Leu Ser Gly Val Gly Gly lie Arg Leu 
100 105 110 

CCT AAC GGA AAA CTA AAG TGT GAT ATC TGT GGG ATC GTT TGC ATC GGG 
384 

Pro Asn Gly Lys Leu Lys Cys Asp lie Cys Gly lie Val Cys lie Gly 
115 120 125 

CCC AAT GTG CTC ATG GTT CAC AAA AGA AGT CAT ACT GGA GAC AAG TGC 
432 

Pro Asn Val Leu Met Val' His Lys Arg Ser His Thr Gly Asp Lys Cys 
130 135 140 

CTG TCA GAC ATG CCC TAT GAC AGT GCC AAC TAT GAG AAG GAG GAT ATG 
480 

Leu Ser Asp Met Pro Tyr Asp Ser Ala Asn Tyr Glu Lys Glu Asp Met 
145 150 155 160 

ATG ACA TCC CAC GTG ATG GAC CAG GCC ATC AAC AAT GCC ATC AAC TAC 
528 

Met Thr Ser His Val Met Asp Gin Ala lie Asn Asn Ala lie Asn Tyr 

165 170 175 

CTG GGG GCT GAG TCC CTG CGC CCA TTG GTG CAG ACA CCC CCC GGT AGC 
576 

Leu Gly Ala Glu Ser Leu Arg Pro Leu Val Gin Thr Pro Pro Gly Ser 
180 185 190 

TCC GAG GTG GTG CCA GTC ATC AGC TCC ATG TAC CAG CTG CAC AAG CCC 
624 

Ser Glu Val Val Pro Val lie Ser Ser Met Tyr Gin Leu His Lys Pro 
195 200 205 

CCC TCA GAT GGC CCC CCA CGG TCC AAC CAT TCA GCA CAG GAC GCC GTG 
672 



- 125- 

Pro Ser Asp Gly Pro Pro Arg Ser Asn His Ser Ala Gin Asp Ala Val 
210 215 220 

GAT AAC TTG CTG CTG CTG TCC AAG GCC AAG TCT GTG TCA TCG GAG CGA 
5 720 

Asp Asn Leu Leu Leu Leu Ser Lys Ala Lys Ser Val Ser Ser Glu Arg 
225 230 235 240 

GAG GCC TCC CCG AGC AAC AGC TGC CAA GAC TCC ACA GAT ACA GAG AGC 
10 768 

Glu Ala Ser Pro Ser Asn Ser Cys Gin Asp Ser Thr Asp Thr Glu Ser 
245 250 255 

AAC GCG GAG GAA CAG CGC AGC GGC CTT ATC TAC CTA ACC AAC CAC ATC 
15 816 

Asn Ala Glu Glu Gin Arg Ser Gly Leu lie Tyr Leu Thr Asn His lie 
260 265 270 

AAC CCG CAT GCA CGC AAT GGG CTG GCT CTC AAG GAG GAG CAG CGC GCC 
20 864 

Asn Pro His Ala Arg Asn Gly Leu Ala Leu Lys Glu Glu Gin Arg Ala 

275 280 285 

TAC GAG GTG CTG AGG GCG GCC TCA GAG AAC TCG CAG GAT GCC TTC CGT 
25 912 

Tyr Glu Val Leu Arg Ala Ala Ser Glu Asn Ser Gin Asp Ala Phe Arg 
290 295 300 

GTG GTC AGC ACG AGT GGC GAG CAG CTG AAG GTG TAC AAG TGC GAA CAC 
30 960 

Val Val Ser Thr Ser Gly Glu Gin Leu Lys Val Tyr Lys Cys Glu His 
305 310 315 320 

TGC CGC GTG CTC TTC CTG GAT CAC GTC ATG TAT ACC ATT CAC ATG GGC 
35 1008 

Cys Arg Val Leu Phe Leu Asp His Val Met Tyr Thr lie His Met Gly 

325 330 335 

TGC CAT GGC TGC CAT GGC TTT CGG GAT CCC TTT GAG TGT AAC ATG TGT 
40 1056 

Cys His Gly Cys His Gly Phe Arg Asp Pro Phe Glu Cys Asn Met Cys 
340 345 350 

GGT TAT CAC AGC CAG GAC AGG TAC GAG TTC TCA TCC CAT ATC ACG CGG 
45 1104 

Gly Tyr His Ser Gin Asp Arg Tyr Glu Phe Ser Ser His lie Thr Arg 
355 360 365 

GGG GAG CAT CGT TAC CAC CTG AGC 
50 1128 

Gly Glu His Arg Tyr His Leu Ser 

370 375 



55 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1004 base pairs 



- 126 



(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 1002 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GGA GAA CGG CCC TTC CAG TGC AAT CAG TGC GGG GCC TCA TTC ACC CAG 
48 

Gly Glu Arg Pro Phe Gin Cys Asn Gin Cys Gly Ala Ser Phe Thr Gin 

15 10 15 

AAG GGC AAC CTG CTC CGG CAC ATC AAG CTG CAT TCC GGG GAG AAG CCC 
96 

Lys Gly Asn Leu Leu Arg His lie Lys Leu His Ser Gly Glu Lys Pro 
20 25 30 

TTC AAA TGC CAC CTC TGC AAC TAC GCC TGC CGC CGG AGG GAC GCC CTC 
144 

Phe Lys Cys His Leu Cys Asn Tyr Ala Cys Arg Arg Arg Asp Ala Leu 
35 40 45 

ACT GGC CAC CTG AGG ACG CAC TCC GTC ATT AAA GAA GAA ACT AAG CAC 
192 

Thr Gly His Leu Arg Thr His Ser Val lie Lys Glu Glu Thr Lys His 

50 " " 55 -60 - - 

AGT GAA ATG GCA GAA GAC CTG TGC AAG ATA GGA TCA GAG AGA TCT CTC 
240 

Ser Glu Met Ala Glu Asp Leu Cys Lys lie Gly Ser Glu Arg Ser Leu 

65 70 75 80 

GTG CTG GAC AGA CTA GCA AGT AAT GTC GCC AAA CGT AAG AGC TCT ATG 
288 

Val Leu Asp Arg Leu Ala Ser Asn Val Ala Lys Arg Lys Ser Ser Met 

85 90 95 

CCT CAG AAA TTT CTT GGG GAC AAG GGC CTG TCC GAC ACG CCC TAC GAC 
336 

Pro Gin Lys Phe Leu Gly Asp Lys Gly Leu Ser Asp Thr Pro Tyr Asp 

100 105 110 

AGT GCC ACG TAC GAG AAG GAG AAC GAA ATG ATG AAG TCC CAC GTG ATG 
384 

Ser Ala Thr Tyr Glu Lys Glu Asn Glu Met Met Lys Ser His Val Met 
115 120 125 

GAC CAA GCC ATC AAC AAC GCC ATC AAC TAC CTG GGG GCC GAG TCC CTG 
432 

Asp Gin Ala lie Asn Asn Ala lie Asn Tyr Leu Gly Ala Glu Ser Leu 



130 

CGC CCG CTG GTG 
480 

Arg Pro Leu Val 
145 

ATC AGC CCG ATG 
528 

lie Ser Pro Met 



TCC AAC CAC TCG 
576 

Ser Asn His Ser 
180 

TCC AAG GCC AAG 
624 

Ser Lys Ala Lys 
195 

AGC TGC CAA GAC 
672 

Ser Cys Gin Asp 
210 

AGC GGT CTT ATC 
720 

Ser Gly Leu lie 
225 

GTG TCG CTC AAG 
768 

Val Ser Leu Lys 



TCC GAG AAC TCG 
816 

Ser Glu Asn Ser 
260 

CAG ATG AAG GTG 
864 

Gin Met Lys Val 
275 

CAC GTC ATG TAC 
912 

His Val Met Tyr 
290 

TTT GAG TGC AAC 
960 

Phe Glu Cys Asn 
305 



135 

CAG ACG CCC CCG 

Gin Thr Pro Pro 
150 

TAC CAG CTG CAC 

Tyr Gin Leu His 
165 

GCC CAG GAC AGC 
Ala Gin Asp Ser 

TTG GTG CCC TCG 

Leu Val Pro Ser 
200 

TCC ACG GAC ACC 

Ser Thr Asp Thr 
215 

TAC CTG ACC AAC 

Tyr Leu Thr Asn 
230 

GAG GAG CAC CGC 

Glu Glu His Arg 
245 

CAG GAC GCG CTC 
Gin Asp Ala Leu 

TAC AAG TGC GAA 

Tyr Lys Cys Glu 
280 

ACC ATC CAC ATG 

Thr lie His Met 
295 

ATG TGC GGC TAC 

Met Cys Gly Tyr 
310 



- 127- 

140 

GGC GGT TCC GAG 

Gly Gly Ser Glu 
155 

AGG CGC TCG GAG 

Arg Arg Ser Glu 
170 

GCC GTG GAG TAC 

Ala Val Glu Tyr 
185 

GAG CGC GAG GCG 

Glu Arg Glu Ala 

GAG AGC AAC AAC 

Glu Ser Asn Asn 
220 

CAC ATC GCC CGA 

His lie Ala Arg 
235 

GCC TAC GAC CTG 

Ala Tyr Asp Leu 
250 

CGC GTG GTC AGC 

Arg Val Val Ser 
265 

CAC TGC CGG GTG 
His Cys Arg Val 

GGC TGC CAC GGC 

Gly Cys His Gly 
300 

CAC AGC CAG GAC 

His Ser Gin Asp 
315 



GTG GTC CCG GTC 

Val Val Pro Val 
160 

GGC ACC CCG CGC 

Gly Thr Pro Arg 
175 

CTG CTG CTG CTC 

Leu Leu Leu Leu 
190 

TCC CCG AGC AAC 

Ser Pro Ser Asn 
205 

GAG GAG CAG CGC 
Glu Glu Gin Arg 

CGC GCG CAA CGC 

Arg Ala Gin Arg 
240 

CTG CGC GCC GCC 

Leu Arg Ala Ala 
255 

ACC AGC GGG GAG 

Thr Ser Gly Glu 
270 

CTC TTC CTG GAT 

Leu Phe Leu Asp 
285 

TTC CGT GAT CCT 
Phe Arg Asp Pro 

CGG TAC GAG TTC 

Arg Tyr Glu Phe 
320 



- 128 - 



TCG TCG CAC ATA ACG 
1004 

Ser Ser His lie Thr 
325 



CGA GGG GAG CAC CGC 

Arg Gly Glu His Arg 
330 



TTC CAC ATG AGC TA 
Phe His Met Ser 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: C- terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Xaa Xaa Ala Ser Asn Val Lys Val Glu Thr Gin Ser Asp Glu Glu Asn 
15 10 15 

Gly Arg Ala Cys Glu Met Asn Gly Glu Glu Cys Ala Glu Asp Leu Arg 
20 25 30 

Met Leu Asp Ala Ser Gly Glu Lys Met Asn Gly Ser His Arg Asp Gin 
35 40 45 

Gly Ser Ser Ala Leu Ser Gly Val Gly Gly lie Arg Leu Pro Asn Gly 
50 55 60 

Lys Leu Lys Cys Asp lie Cys Gly lie Xaa Cys lie Gly Pro Asn Val 
65 70 75 80 

Leu Met Val His Lys Arg Ser His Thr Gly Glu Arg Pro Phe Gin Cys 
85 90 95 

Asn Gin Cys Gly Ala Ser Phe Thr Gin Lys Gly Asn Leu Leu Arg His 
100 105 110 

lie Lys Leu His Ser Gly Glu Lys Pro Phe Lys Cys His Leu Cys Asn 
115 120 125 

Tyr Ala Cys Arg Arg Arg Asp Ala Leu Thr Gly His Leu Arg Thr His 
130 135 140 

Ser Val Gly Lys Pro His Lys Cys Gly Tyr Cys Gly Arg Ser Tyr Lys 
145 150 155 160 

Gin Arg Xaa Ser Leu Glu Glu His Lys Glu Arg Cys His Asn Tyr Leu 
165 170 175 



Glu Ser Met Gly Leu Pro Gly Xaa Xaa Xaa Pro Val lie Lys Glu Glu 
180 185 190 



- 129- 



Thr Xaa His Xaa Glu Met Ala Glu Asp Leu Cys Lys lie Gly Xaa Glu 
195 200 205 

Arg Ser Leu Val Leu Asp Arg Leu Ala Ser Asn Val Ala Lys Arg Lys 
210 215 220 

Ser Ser Met Pro Gin Lys Phe Leu Gly Asp Lys Xaa Leu Ser Asp Xaa 
225 230 235 240 

Pro Tyr Asp Ser Ala Xaa Tyr Glu Lys Glu Xaa Xaa Met Met Xaa Ser 
245 250 255 

His Val Met Asp Xaa Ala lie Asn Asn Ala lie Asn Tyr Leu Gly Ala 
260 265 270 

Glu Ser Leu Arg Pro Leu Val Gin Thr Pro Pro Gly Xaa Ser Glu Val 
275 280 285 

Val Pro Val lie Ser Pro Met Tyr Gin Leu His Xaa Xaa Xaa Ser Xaa 
290 295 300 

Gly Xaa Pro Arg Ser Asn His Ser Ala Gin Asp Xaa Ala Val Xaa Xaa 
305 . 310 315 320 

Leu Leu Leu Leu Ser Lys Ala Lys Xaa Val Xaa Ser Glu Arg Glu Ala 
325 330 335 

Ser Pro Ser Asn Ser Cys Gin Asp Ser Thr Asp Thr Glu Ser Asn Xaa 
340 345 350 

Glu Glu Gin Arg Ser Gly Leu lie Tyr Leu Thr Asn His lie Xaa Xaa 
355 360 365 

Xaa Ala Xaa Xaa Xaa Xaa Xaa Leu Lys Glu Glu Xaa Arg Ala Tyr Xaa 
370 375 380 

Xaa Leu Arg Ala Ala Ser Glu Asn Ser Gin Asp Ala Xaa Arg Val Val 
385 390 395 400 

Ser Thr Ser Gly Glu Gin Xaa Lys Val Tyr Lys Cys Glu His Cys Arg 
405 410 415 

Val Leu Phe Leu Asp His Val Met Tyr Thr lie His Met Xaa Xaa Xaa 
420 425 430 

Gly Cys His Gly Phe Arg Asp Pro Phe Glu Cys Asn Met Cys Gly Tyr 
435 440 445 

His Ser Gin Asp Arg Tyr Glu Phe Ser Ser His lie Thr Arg Gly Glu 
450 455 460 



His Arg Xaa His Xaa Ser 
465 470 



